Creating hidden Markov models for fast speech by optimized clustering

نویسندگان

Robert Faltlhauser

Thilo Pfau

Günther Ruske

چکیده

Previous studies have shown that the recognition accu racy often severely degrades at higher speech rates which can basically be traced back to two main dimensions acoustic and phonemic Reasons for this e ect can be found in the phonemic eld e g elisions as well as on the acoustic level with increasing rates of speech the spec tral characteristics are changing A main obstacle in this context is the training data consisting of only a small fraction of samples which can be labeled as fast There fore the e ects caused by an increased speech rate often cannot be completely covered To meet this problem in this paper an optimized clustering process is presented making e cient use of the available data Our modi ed mixture splitting algorithm with an incorporated cross validation step aims at increasing the generalization of Hidden Markov Models especially with respect to fast speech Experimental results showed a relative decrease in word error rate of for fast speech

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

متن کامل

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Syllable-length path mixture hidden Markov models with trajectory clustering for continuous speech recognition

Recent research suggests that modeling coarticulation in speech is more appropriate at the syllable level. However, due to a number of additional factors that can affect the way syllables are articulated, creating multiple acoustic models per syllable might be necessary. Our previous research on longer-length multi-path models has proved that data-driven trajectory clustering to be an attractiv...

متن کامل

Phone set selection for HMM-based dialect speech synthesis

This paper describes a method for selecting an appropriate phone set in dialect speech synthesis for a so far undescribed dialect by applying hidden Markov model (HMM) based training and clustering methods. In this pilot study we show how a phone set derived from the phonetic surface can be optimized given a small amount of dialect speech training data.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

Creating hidden Markov models for fast speech by optimized clustering

نویسندگان

چکیده

منابع مشابه

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

Speech enhancement based on hidden Markov model using sparse code shrinkage

Syllable-length path mixture hidden Markov models with trajectory clustering for continuous speech recognition

Phone set selection for HMM-based dialect speech synthesis

عنوان ژورنال:

اشتراک گذاری